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CROSS-REFERENCE TO RELATED APPLICATIONS 
[0001] This application is based upon and claims the benefit of priority from the prior 
Japanese Patent Application No. 2003-039741, filed on February 18, 2003, the entire 
contents of which are incorporated herein by reference. 

BACKGROUND OF THE INVENTION 
Field of the Invention 

[0002] The present invention relates to an information processing unit, particularly, the 
present invention is suitable for application to an information processing unit having a 
plurality of processors on one chip, which are able to operate in parallel. 

Description of the Related Art 

[0003] An SIMD-(Single Instruction Multiple Data)-parallel processor is one type of 
parallel computer. The SIMD parallel processor has a plurality of processors on one chip, 
executing a process corresponding to an instruction supplied thereto in parallel 
simultaneously with the plural processors. 

[0004] Each processor in the SIMD parallel process consists of a processor element and a 
memory. In the SIMD processor, since a process corresponding to the supplied instruction is 
to be carried out in parallel by all processors, the memory of each processor holds only data. 

[0005] Therefore, though the SIMD processor has plural processors, it is not able to 
execute different kinds of processing with each processor in parallel. For example, 
while executing a retrieval process with some processors, the SIMD processor can not 
execute a detailed comparing process with other processors in parallel, in a processing 
of matching data and so on. 

[0006] In order to execute different kinds of processing in parallel with the plural 
processors, it is necessary that the program (instruction) should be stored in the memory of 
each processor. However, in the instruction set used for a general-purpose processor in recent 
years, a single instruction is organized into 32-bits or 64-bits. If the memory of each 
processor holds the program by which the same processing as done in a general-purpose 
processor is executed, high memory capacity is required. However, in order to provide many 
processors in one chip, the memory capacity of each processor is preferred to be small. 
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SUMMARY OF THE INVENTION 
[0007] In view of the above, it is an object of the present invention to enable a plurality of 
processors provided on one chip to execute various and different kinds of processing in 
parallel, while minimizing the increase of the memory capacity required to store a program in 
each processor. 

[0008] An information processing unit according to the present invention is an 
information processing unit having a plurality of processors on one chip, which is 
characterized by plural processors that can execute instructions independently. Each 
processor includes a decoder circuit for determining the instruction to be uniquely executed, 
based on an input history of instruction codes, by use of an instruction code input to the 
processor from a plurality of instructions which are assigned to the instruction code. 

[0009] According to the present invention, since one instruction code corresponds to a 
plurality of instructions, the instruction code length can be shorter than the case where 
different instruction codes are made to correspond to the plurality of instructions, so that 
different kinds of processing can be performed by the same instruction code. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0010] Fig.l is a block diagram showing a configuration of a parallel processor to which 
an information processing unit according to an embodiment of the present invention is 
applied; 

[0011] Fig.2 is a view explaining the concept of the processing operation in each 
processor according to the present embodiment; 

[0012] Fig.3 is a block diagram showing a configuration of an instruction decoding 
portion; 

[0013] Fig.4 is a view showing an example of a look up table for setting a rule for 
changing a group code; and 

[0014] Fig.5 is a view showing an example of a group change determination circuit. 

DETAILED DESCRIPTION OF THE INVENTION 
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[0015] An embodiment of the present invention will be described hereinafter in reference 
to the drawings. 

[0016] Fig.l is a block diagram showing a configuration of a parallel processor 10 to 
which the information processing unit of an embodiment according to the present invention 
applies. 

[0017] The parallel processor 10 is composed of plural processors 1 1 which are connected 
to each other and contained on one chip. Each processor 1 1 has a memory 12, an instruction 
decoding portion 13 and a plurality of processor elements (PE) 14, each of which is able to 
execute a certain instruction independently. The memory 12 includes a program memory 
(instruction memory area) for storing a program (instruction) and a data memory (data 
memory area) for storing data. 

[0018] The instruction decoding portion 13 decodes the instruction, reading out of the 
program memory portion of the memory 12. Also the instruction decoding portion 13 
controls an internal register described later and the processor elements 14 based on the result 
of decoding. A detailed description of the instruction decoding portion 13 will be given later. 
The processor elements 14 are conventionally known SIMD processor elements, executing 
operations according to control signals supplied from the instruction decoding portion 13. 

[0019] Though the processors 1 1 each have 8-processor elements 14 as in the example 
shown in Fig.l, each processor 1 1 is able to have optional numbers of processor elements 14, 
for example, a single processor element will be acceptable. 

[0020] Fig.2 is a view explaining the concept of the processing operation in each 
processor 1 1 . In the following description, one instruction code is organized to have 8-bits, 
and a group code described later is organized to have 3-bits. 

[0021] In Fig.2, 21 is an instruction queue, being composed of plural instruction memories 
so as to hold plural instruction codes. Each instruction memory has an 8-bit memory area. 

[0022] The instruction decoding portion 13 includes a group register 22 and an instruction 
decoder circuit 23. The group register 22 is a register for storing a group code, having 3-bit 
memory area. The instruction decoder circuit 23 decodes the instruction code, with reference 
to the group code to execute a control operation according to the decoded result. 
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[0023] The elements 24-j , where "i" is a subscript, i = 1 - n (n is an optional natural 
number), are processor elements equivalent to the processor elements 14 shown in Fig.l. 

[0024] The group code designates one of the groups into which the input instructions 
concerning the processing executable in the processor 1 1 are sorted based on a predetermined 
rule. The group codes have different values from each other in every group. 

[0025] The input instructions are sorted into groups, such as the instruction for the four 
basic operations of addition, subtraction, multiplication, and division. They also include a 
shift operation or a bit operation, and an instruction for a WTA operation used for matching 
data and the like, a burst operation instruction, a register operation instruction, a memory 
control instruction, a flow control instruction, a scalar instruction and so on. The instruction 
codes of respective input instructions are different from each other in one single group, but 
the same instruction code can exist in another group. Namely, plural instructions from 
different groups can be expressed by one instruction code. 

[0026] In addition, there exist global instructions which does not belong to any group 
(which are independent of the groups). The global instructions are, for example, a group 
change instruction for changing the group code, a group temporary change instruction for 
changing the group code of immediately following instruction, an alias execution instruction 
for executing an optional instruction previously assigned and the like. 

[0027] The operation will be described as follows. 

[0028] The instruction decoder circuit 23 read an instruction code ICD, one byte (8-bits) 
at a time out of the instruction memory of the instruction queue 21. The instruction decoder 
circuit 23 adds a group code GR supplied from the group register 22 to the read-out 
instruction code ICD and decodes them. Namely, the instruction decoder circuit 23 decodes 
an 1 1-bit code (hereinafter referred to as the "internal instruction code") consisting of the 3- 
bit group code GR and the 8-bit instruction code. 

[0029] When the instruction is a standard instruction, for example, an instruction for one 
of the four basic operations and the like as a result of decoding, the instruction decoder circuit 
23 supplies a control signal INS corresponding to the instruction to each of the processor 
elements 24-j. When the instruction is to change the group code, such as the group 
(temporary) change instruction as a result of decoding, the instruction decoder circuit 23 
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writes the thus-changed group code GRS designated by this instruction into the group register 
22. 

[0030] Fig.3 is a block diagram showing an example of a configuration of the instruction 
decoding portion 13. 

[0031] In Fig.3, numeral 31 designates an instruction memory for holding the instruction 
code; processor elements 32 are used for executing various kinds of processing according to 
control by the instruction decoding portion 13. 

[0032] The instruction decoding portion 13 includes an instruction bit linkage part 33, an 
alias instruction decoder 34, an alias instruction register 35_j , where "j" is a subscript and an 
optional natural number, a first selector 36, a group change instruction decoder 37, a standard 
instruction decoder 38, an alias change instruction decoder 39, a plural-cycle instruction state 
register 40, a group register 41, a second selector 42, and a group temporary memory register 
43. 

[0033] The instruction bit linkage part 33 reads the instruction code out of the instruction 
memory 31 one byte at a time. This instruction bit linkage part 33 is configured, based on the 
instruction code which has been input in the past, so as to output an internal instruction code, 
by linking (adding) of the group code held in the group temporary memory register 43 to the 
instruction code read out of the instruction memory 3 1 . 

[0034] The alias instruction decoder 34 determines whether the internal instruction code 
supplied from the instruction bit linkage part 33 is an alias execution instruction which 
replaces the internal instruction code with the code of the alias instruction, or not. Depending 
on the determination, when the internal instruction code is an alias execution instruction, the 
alias instruction decoder 34 reads the alias instruction (the internal instruction to be replaced) 
out of the alias instruction register 35.j corresponding to the group code of the internal 
instruction code so as to output the instruction. 

[0035] The alias instruction registers 35_j are set in every group, in which the internal 
instruction codes of the instruction of other groups assigned as the alias instruction are stored 
respectively. As for the alias instruction registers 35. J} they are not limited to only those in the 
embodiment shown in Fig.3, but also to another configuration that has a common alias 
instruction for all groups, or a still another configuration that has plural alias instructions in 
each group. 
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[0036] The first selector 36 selectively outputs the internal instruction code supplied from 
the instruction bit linkage part 33, or the internal instruction code supplied from the alias 
instruction decoder 34. 

[0037] The group change instruction decoder 37 changes one group code into another new 
group code, when the internal instruction code supplied from the first selector 36 is a group 
change instruction or a group temporary change instruction. In addition to the group 
(temporary) change instruction, the group change instruction decoder 37 can change the 
group code into a new group code when the internal instruction code supplied from the first 
selector 36 satisfies the rule for changing the group code which has been previously defined. 
The rule for changing the group code can be defined with, for example, a look up table (LUT) 
44, which relates the internal instruction code to the group code to be changed. The look up 
table is provided in the group change instruction decoder 37. 

[0038] The group change instruction decoder 37 writes a new group code value into the 
group register 41 in the case of changing the group code based on the group change 
instruction or the rule for changing the group code. Hereby, the value held in the group 
register 41 is replaced with the new group code value. Further, the value held in the group 
register 41 is supplied to the group temporary memory register 43 through the second selector 
42 to be held therein. 

[0039] When only the group code of an immediately following instruction is to be 
changed based on the group temporary change instruction, the group change instruction 
decoder 37 supplies the subject group code value to the group temporary memory register 43 
through the second selector 42. Hereby, the value held in the group temporary memory 
register 43 is replaced with a new group code value. In this case, the group change instruction 
decoder 37 does not write the group code into the group register 41. 

[0040] The standard instruction decoder 38 decodes the internal instruction code and 
outputs a control signal to the processor elements 32 in the same way as in a conventional 
processor, when the internal instruction code supplied from the first selector is a standard 
instruction (such as the four basic operations). 

[0041] The alias change instruction decoder 39 rewrites the value of the alias instruction 
register 35.j, when the internal instruction code supplied from the first selector 36 is an alias 
change instruction code.. Specifically, the alias change instruction decoder 39 rewrites the 
value of the alias instruction register 35.j corresponding to the group code, the change of 
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which is indicated by the internal instruction code, into the internal instruction code (the 
group code as well as the instruction code) of the instruction code to be replaced according to 
the internal instruction. 

[0042] The plural-cycle instruction state register 40 controls, when the alias instruction or 
the like replaced by the alias execution instruction is an instruction which occupies more than 
a 2-byte length, the first selector 36 to replace only the first byte of the instruction with the 
alias instruction and ignore the alias instruction over the second byte. 

[0043] The plural-cycle instruction state register 40 therefore controls the first selector 36 
so as to only select the first byte of the internal instruction code from the alias instruction 
decoder 34 and subsequently select the internal instruction code from the instruction bit 
linkage part 33 over the two bytes. That is why the first byte of the instruction indicates the 
instruction itself and the instruction over two byte may be data or the like, when the alias 
instruction or the like has more than a two byte length. 

[0044] Incidentally, the group change instruction decoder 37, the standard instruction 
decoder 38, the alias change instruction decoder 39 and the plural-cycle instruction state 
register 40 communicate with each other during operation. 

[0045] The second selector 42 selectively supplies the group code value held in the group 
register 41 or the group code value supplied from the group change instruction decoder 37 to 
the group temporary memory register 43, according to the control of the group change 
instruction decoder 37. Specifically, the second selector 42 selects and outputs the value from 
the group change instruction decoder 37 only in the case where the group code is changed 
according to the group temporary change instruction. Otherwise, the second selector 42 will 
select and output the value from the group register 41. 

[0046] Though the instruction decoding portion 13 is not shown with a flow control 
circuit and the like such as a program counter, a register, a conditional branch, those can be 
provided therein. 

[0047] Fig.4 is an explanatory view showing an example of the LUT 44, which sets the 
rule for changing the group code. 

[0048] As shown in Fig.4, the rule for changing the group code, as one group, is to be 
defined with an instruction mask IMk , where "k" is a subscript and optional natural number. 
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The following two entries, an instruction code ICk and a changed group code GCk have the 
same subscript. The instruction mask IM k and the instruction code ICk are defined by 1 1-bit 
equivalent to the group code (3-bit) and the instruction code (8-bit), and the group code GCk 
is defined by the group code only (3-bit). 

[0049] The instruction mask IMk sets a bit to be masked by the input instruction (internal 
instruction code). The instruction mask IMk defines O(Zero)" to a bit to be masked, and also 
"1" to a bit not to be masked. 

[0050] The instruction code ICk defines the internal instruction code to be compared with 
the input instruction. The instruction code ICk defines O(Zero)" as a bit to be masked and 
also defines a certain value for the bits not to be masked by the instruction mask IMk. The 
group code GCk defines the changed group code to be set newly, in the case that the group 
code agrees with a condition set by the instruction mask IM k and the instruction code ICk. 

[0051] For example, as shown in Fig.4, when the values "11 1 10000000", "101 10000000", 
"101" are set for the instruction mask IM1, the instruction code IC1, and the group code GC1 
respectively, each of the 4 high order bits of the instruction mask IM1 is "1". When the value 
of the 4 high order bits of the input instruction is compared with the value of the 4 high order 
bits of the instruction code IC1, if in agreement, the group code will be changed into "101". 
In the case that the value of the 4 high order bits of the input instruction is "1011", the group 
code will be changed into "101". 

[0052] Similarly, when the values "l 1 1 1 1000000", "1 101 1000000", "110" are set to the 
instruction mask IM2, the instruction code IC2, and the group code GC2 respectively, the 
group code will be changed into "110" if the value of the 5 high order bits of the input 
instruction is "11011". 

[0053] The LUT44 shown in Fig.4 is just one example, and the present invention is not 
limited to it. In another example, an LUT, wherein the plural groups of the instruction mask 
and the instruction code and one changed group code are set as one group, and the group code 
will be changed according to plural input instructions including the previous input 
instruction, can be acceptable. 

[0054] Fig. 5 is a view showing one example of a group change determination circuit for 
determining whether the input instruction (internal instruction code) satisfies the rule for 
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changing the group code shown in Fig.4, or not. This determination circuit is provided, for 
example, in the group change instruction decoder 37. 

[0055] As shown in Fig. 5, in the determination circuit, the logical multiplication operation 
between one bit IN m , where "m" is a subscript, and an integer between 0-10, in the input 
instruction and one bit MB m in the instruction mask MSk is performed at a logical 
multiplication operation (AND) circuit 5 1 for each corresponding bit. Further, an exclusive 
negative OR operation between the operation result of the AND circuit 5 1 and one bit IB m in 
the instruction code CODE is performed at an exclusive negative OR operation (EX-NOR) 
circuit 52 for each corresponding bit. By means of an AND circuit 53 connected in series, the 
logical multiplication operation of all operation results of each EX-NOR circuit 52 is carried 
out so as to output the operation result thereof as a selecting signal SEL. 

[0056] With the determination circuit thus being configured, the selecting signal will be at 
a high-level only in the case where the input instruction agrees with the condition of the 
changing rule set by the instruction mask MSk and the instruction code CODE. As a result, 
the changed group code set by the said changing rule will be selected. 

[0057] According to the present embodiment as described above, in the parallel processor 
10 having the plural processors 11, the instruction decoding portion 13 in each processor 1 1 
adds the group code which is determined a the past instruction code to the instruction code 
which is read out of the instruction memory so as to generate the internal instruction code. 
Further, the instruction decoding portion 13 performs various kinds of controls, uniquely 
determining the instruction to be executed, based on the internal instruction code, from 
among the plurality of instructions assigned to the instruction code which is read out. 

[0058] Accordingly, the plural instructions correspond to one instruction code to express 
each of the instructions with a short instruction code length, so that the increase of the 
capacity required for the program memory holding the instruction code can be lowered. 
Moreover, different kinds of processing can be performed with the same instruction code, 
according to the group code determined by a past instruction code. As a result, various and 
advanced processing can be executed compared to hitherto available processors. Besides, by 
setting the alias instruction to which an optional instruction is assigned, the instruction of 
other groups can be executed immediately by one instruction code. 
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[0059] In the above described embodiment, the instruction code is defined by an 8-bit 
code, and the group code is defined by a 3 -bit code, however, this is only a preferred example 
and the present invention is not limited to this example. 

[0060] The present embodiments are to be considered in all respects as illustrative and not 
restrictive, and all changes which come within the meaning and range of equivalency of the 
claims are therefore intended to be embraced therein. The invention may be embodied in 
other specific forms without departing from the spirit or essential characteristics thereof. 

[0061] As described above, according to the present invention, without a need for 
different kinds of instruction codes corresponding to every instruction which can be executed 
in the processor, one instruction code corresponds to a plurality of instructions and each 
processor determines the instruction to be executed uniquely, based on the input history of 
instruction codes, from the plural instructions by the instruction code. Thus, every instruction 
can be expressed by short instruction code length so that the increase of memory capacity 
required to hold the instruction code at each processor is well lowered. Also, if the same 
instruction code is inputted, different kinds of instructions can be executed according to the 
input history of the instruction code, as a result, various and different kinds of processing can 
be executed by the plurality of processors in parallel. 
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